Skip to content

robinvndaele/NoisyDistances

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This folder contains the code for the paper 'The Curse Revisited: When are Distances Informative for the Ground Truth in Noisy High-Dimensional Data?', presented at The 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022.

arxiv submission: https://arxiv.org/abs/2109.10569.

Setup

Enviroments

  • R (Rstudio)
  • Python (Jupyter notebook)

R requirements

  • ggplot2 (general plotting)
  • igraph (working with graphs)
  • ggpubr (combining plots)
  • FNN (fast k-NN graph computation)
  • parallel (parallel processing in R)
  • umap (umap dimensionality reduction)
  • diffusionMap (diffusion map dimensionality reduction)
  • rpca (Robust PCA dimensionality reduction)
  • dimRed (Isomap dimensionality reduction)

Python Requirements

  • numpy (handling arrays)
  • pandas (handling data frames)
  • pytorch (neural networks in Python)
  • matplotlib (plotting)

Datasets

Run

  • Folder "Scripts": contains the code (R) for producing the cell trajectory visualization prior to the experiments section in the main paper.
  • Folder "Experiments": contains all code (R + Python) for producing the results in the experiments section of the main paper, with files named accordingly.

R

  • Open file in Rstudio
  • Source --> Source with Echo (ctrl + shift + enter).

Python

  • Open file in Jupyter notebook
  • Cell --> Run Cells (ctrl + enter).

Results

  • Folder "OutputPDF": contains all output by code block in PDF format, with files named accordingly.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published